NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Exporting Ada Software to Python and Julia

https://doi.org/10.1145/3577949.3577961

Verschelde, Jan (December 2022, ACM SIGAda Ada Letters)

The objective is to demonstrate the making of Ada software available to Python and Julia programmers using GPRbuild. GPRbuild is the project manager of the GNAT toolchain. With GPRbuild the making of shared object files is fully automated and the software can be readily used in Python and Julia. The application is the build process of PHCpack, a free and open source software package to solve polynomial systems by homotopy continuation methods, written mainly in Ada, with components in C++, available at github at https://github.com/janverschelde/PHCpack.
more » « less
Full Text Available
Least Squares on GPUs in Multiple Double Precision

Verschelde, Jan (April 2022, 2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW))

This paper describes the application of the code generated by the CAMPARY software to accelerate the solving of linear systems in the least squares sense on Graphics Processing Units (GPUs), in double double, quad double, and octo double precision. The goal is to use accelerators to offset the cost overhead caused by multiple double precision arithmetic. For the blocked Householder QR and the back substitution, of interest are those dimensions at which teraflop performance is attained. The other interesting question is the cost overhead factor that appears each time the precision is doubled. Experimental results are reported on five different NVIDIA GPUs, with a particular focus on the P100 and the V100, both capable of teraflop performance. Thanks to the high Compute to Global Memory Access (CGMA) ratios of multiple double arithmetic, teraflop performance is already attained running the double double QR on 1,024-by-1,024 matrices, both on the P100 and the V100. For the back substitution, the dimension of the upper triangular system must be as high as 17,920 to reach one teraflops on the V100, in quad double precision, and then taking only the times spent by the kernels into account. The lower performance of the back substitution in small dimensions does not prevent teraflop performance of the solver at dimension 1,024, as the time for the QR decomposition dominates. In doubling the precision from double double to quad double and from quad double to octo double, the observed cost overhead factors are lower than the factors predicted by the arithmetical operation counts. This observation correlates with the increased performance for increased precision, which can again be explained by the high CGMA ratios.
more » « less
Full Text Available
Accelerated Polynomial Evaluation and Differentiation at Power Series in Multiple Double Precision

https://doi.org/10.1109/ipdpsw52791.2021.00111

Verschelde, Jan (May 2021, 2021 IEEE International Parallel and Disributed Processing Symposium Workshops (IPDPSW))
null (Ed.)
Full Text Available
Robust Numerical Tracking of One Path of a Polynomial Homotopy on Parallel Shared Memory Computers

https://doi.org/10.1007/978-3-030-60026-6_33

Telen, Simon; Van Barel, Marc; Verschelde, Jan (October 2021, Computer Algebra in Scientific Computing. CASC 2020. Lecture Notes in Computer Science, vol 12291)
null (Ed.)
Full Text Available
Parallel Software to Offset the Cost of Higher Precision

https://doi.org/10.1145/3463478.3463483

Verschelde, Jan (April 2021, ACM SIGAda Ada Letters)
null (Ed.)
Hardware double precision is often insufficient to solve large scientific problems accurately. Computing in higher precision defined by software causes significant computational overhead. The application of parallel algorithms compensates for this overhead. Newton's method to develop power series expansions of algebraic space curves is the use case for this application.
more » « less
Full Text Available
Numerical Schubert calculus via the Littlewood-Richardson homotopy algorithm

https://doi.org/10.1090/mcom/3579

Leykin, Anton; Martín del Campo, Abraham; Sottile, Frank; Vakil, Ravi; Verschelde, Jan (May 2021, Mathematics of Computation)
null (Ed.)
Full Text Available
A Robust Numerical Path Tracking Algorithm for Polynomial Homotopy Continuation

https://doi.org/10.1137/19M1288036

Telen, Simon; Barel, Marc Van; Verschelde, Jan (January 2020, SIAM Journal on Scientific Computing)
null (Ed.)
Full Text Available

Search for: All records